40 research outputs found
Categorical data analysis using a skewed Weibull regression model
In this paper, we present a Weibull link (skewed) model for categorical
response data arising from binomial as well as multinomial model. We show that,
for such types of categorical data, the most commonly used models (logit,
probit and complementary log-log) can be obtained as limiting cases. We further
compare the proposed model with some other asymmetrical models. The Bayesian as
well as frequentist estimation procedures for binomial and multinomial data
responses are presented in details. The analysis of two data sets to show the
efficiency of the proposed model is performed
Confidence Statements for Ordering Quantiles
This work proposes Quor, a simple yet effective nonparametric method to
compare independent samples with respect to corresponding quantiles of their
populations. The method is solely based on the order statistics of the samples,
and independence is its only requirement. All computations are performed using
exact distributions with no need for any asymptotic considerations, and yet can
be run using a fast quadratic-time dynamic programming idea. Computational
performance is essential in high-dimensional domains, such as gene expression
data. We describe the approach and discuss on the most important assumptions,
building a parallel with assumptions and properties of widely used techniques
for the same problem. Experiments using real data from biomedical studies are
performed to empirically compare Quor and other methods in a classification
task over a selection of high-dimensional data sets
The Likelihood Ratio Test and Full Bayesian Significance Test under small sample sizes for contingency tables
Hypothesis testing in contingency tables is usually based on asymptotic
results, thereby restricting its proper use to large samples. To study these
tests in small samples, we consider the likelihood ratio test and define an
accurate index, the P-value, for the celebrated hypotheses of homogeneity,
independence, and Hardy-Weinberg equilibrium. The aim is to understand the use
of the asymptotic results of the frequentist Likelihood Ratio Test and the
Bayesian FBST -- Full Bayesian Significance Test -- under small-sample
scenarios. The proposed exact P-value is used as a benchmark to understand the
other indices. We perform analysis in different scenarios, considering
different sample sizes and different table dimensions. The exact Fisher test
for tables that drastically reduces the sample space is also
discussed. The main message of this paper is that all indices have very similar
behavior, so the tests based on asymptotic results are very good to be used in
any circumstance, even with small sample sizes
Ordering Quantiles through Confidence Statements
Ranking variables according to their relevance to predict an outcome is an important task in biomedicine. For instance, such ranking can be used for selecting a smaller number of genes for then applying other sophisticated experiments only on genes identified as important. A nonparametric method called Quor is designed to provide a confidence value for the order of arbitrary quantiles of different populations using independent samples. This confidence may provide insights about possible differences among groups and yields a ranking of importance for the variables. Computations are efficient and use exact distributions with no need for asymptotic considerations. Experiments with simulated data and with multiple real -omics data sets are performed, and they show advantages and disadvantages of the method. Quor has no assumptions but independence of samples, thus it might be a better option when assumptions of other methods cannot be asserted. The software is publicly available on CRAN
Chromosomal Damage and Apoptosis in Exfoliated Buccal Cells from Individuals with Oral Cancer
This study aimed to investigate cytological abnormalities indicative of chromosome damage (micronuclei) and apoptosis (karyorrhexis, pyknosis, and condensed chromatin) in exfoliated cells from the buccal mucosa of patients with oral cancer and control subjects. The sample included twenty individuals with oral cancer and forty individuals with normal buccal mucosa. Material was collected from the cheek epithelium in areas with lesions and areas without abnormalities. A minimum of one thousand cells was analyzed. Micronuclei were found significantly more frequently in cells collected from lesions than in cells from normal areas, independent of the presence/absence of cancer (P < 0.0001). They were also significantly more frequent in smokers and in mouthwash users (P < 0.0001). Apoptosis occurred significantly less frequently in individuals with oral cancer (P < 0.0001). These results show that oral cancer is associated with higher frequency of chromosomal damage and suggest that apoptosis is compromised in the buccal cells of individuals with this kind of neoplasia